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•Abstract 


The  problem  of  time  to  first  failure  for  repairable  coherent  systems 
of  independent  exponential  components  is  discussed.  Several  inequalities 
are  derived  and  related  to  previous  work  of  the  author  and  of  Keilson  to 
obtain  approximations  with  error  bounds  for  the  distribution  of  the  time 
to  first  failure. 
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1..,  Introduction 

1 

The  need  for  quantitative  methods  to  assess  system  reliability  has 
surfaced  in  evaluating  such  diverse  systems  as  nuclear  power  plants, 
automatic  landing  systems  and  chemical  plants.  The  preliminary  steps  to 
system  reliability  assessment  involve  identifying  the  various  modes  of 
system  failure  and  assigning  parameter  valves  to  the  failure  and  repair 
distributions.  A  valuable  tool  in  the  identification  of  system  failure 
is  fault  tree  analysis,  a  topic  of  considerable  research  interest  in  the 
engineering  literature. 

Here,  my  concern  is  with  the  quantitative  analysis  of  reliability 
for  systems  with  repairable  components.  This  usually  follows  qualitative 
analysis,  fault  tree  construction  and  assignment  of  parameters.  From  this 
input  one  desires  to  calculate  system  characteristics,  the  most  important 
of  which  is  the  distribution  of  the  time  to  first  failure. 

In  this  paper  some  ideas  relevant  to  the  study  of  time  to  first 
failure  are  discussed.  In  particular  the  work  of  Keilson  ([10],  [11]) 
for  general  coherent  systems,  is  combined  with  Brown  ([4],  [5])  and  results 
derived  here  to  obtain  approximations  with  error  bounds  for  the  reliability 
of  coherent  systems. 

Sections  2-6  contain  background  material  and  mathematical 
methodology.  The  main  Inequalities  are  presented  in  Section  7.  A 
numerical  example  illustrating  the  precision  of  the  approximation  for 
the  parallel  case  is  presented  in  Section  8. 
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2.  Description  of  the  Mathematical  Model 


The  model  which  I  will  discuss  appears  extensively  in  the 
literature  (Barlow  and  Proschan  [1],  Esary  and  Proschan  [8],  Brown  [4], 
Keilson  ([10],  [11]),  Ross  [15]).  The  system  has  n  independent  components. 
Each  alternates  between  independent  working  periods  (often  called  up 
periods)  and  repair  (down)  periods.  The  distribution  of  the  up  and  down 
periods  may  vary  from  component  to  component.  Define: 


and 


X^t) 


component  i  is  up  at  time  t 
component  is  down  at  time  t 


x(t)  =  (x1(t),...,xn(t)). 


Ths  system  is  up  at  time  t  if  and  only  if  X(t)  e  6,  a  subset  of 
the  state  space  S  (S  contains  the  2n  n-tuples  of  0's  and  l's).  The 
system  is  down  at  time  t  if  and  only  if  X.(t)  e  B,  B  being  the  complement 
of  G  in  S.  The  set  G  depends  on  the  structure  of  the  system.  For 
example  a  k  out  of  n  system  is  one  which  is  up  if  and  only  if  at  least 
k  of  its  n  components  are  up.  The  set  G  in  this  case  consists  of 
all  points  in  S  with  k  or  more  l's  while  B  consists  of  all  points  in 
S  with  k-1  or  less  l's. 

The  k  out  of  n  system  is  an  example  of  a  coherent  system.  A 
coherent  system  is  defined  by  the  property  that  x  e  G  and  ^  >  x 
(meaning  y^  ^x.,  i=l,..n)  implies  ^  e  G.  Thus  a  working  system  cannot 
go  out  of  order  as  a  result  of  repair  of  one  of  its  down  components. 
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The  assumption  of  coherency  is' quite  reasonable  and  plays  an  important 
role  in  the  analytic  treatment  of  the  model. 


A  further  assumption  which  is  often  made  and  which  will  be 
followed  here  is  that  both  up  and  down  periods  are  exponentially  dis¬ 
tributed.  Under  this  assumption  the  process  (X(t),  t_>  0}  is  a  Markov 
process.  As  a  result  the  mathematical  analysis  is  simplified,  although 
formidable  difficulties  remain.  The  problem  of  robustness  of  results 
derived  under  exponential ity  is  of  obvious  importance. 


3.  Construction  of  (X(t),  t  rl  0} 

Consider  a  single  component  with  exponential  up  times  with 
parameter  X  and  exponential  down  times  with  parameter  y  .  A  convenient 
construction  for  {X(t),  t  >  0},  the  zero  -  one  process  representing  the 
component  state  will  now  be  discussed.  It  was  employed  in  Brown  [4] 
for  the  study  of  parallel  systems,  and  is  apparently  quite  well  known 
in  other  contexts. 

Take  two  independent  Poisson  processes,  process  1  with  parameter 
X  and  process  2  with  parameter  y  .  Call  the  superimposed  Poisson  process 
(N(t),  t  ^  0)  and  label  its  event  epochs  by  T, ,  Tg,....  Define 
X(t)  =  X(0)  if  N(t)  =  0,  otherwise  observe  TN^ ,  the  last  event  from 
the  superimposed  process  at  or  prior  to  time  t.  Set  X(t)  =  0  if  T^t) 
came  from  process  1,  and  X(t)  =  1  if  T^tj  came  from  process  2. 

Clearly  the  constructed  process  is  an  alternating  renewal  process 


r. 


with  exponential  holding  times'  with  parameters  X  and  y  ,  and  is  thus  a 
representation  of  component  behavior. 

Next,  perform  this  construction  independently  for  the  n  components, 
letting  X(0)  have  whatever  initial  distribution  is  of  interest.  We  thus 
have  a  version  of  (X(t),  t  ^0}. 

Applications  of  this  construction  will  follow  in  Sections  4,  5 

and  6. 


4.  Stochastic  Monotonicitv 


Suppose  we  want  to  compare  the  process  XU)  under  two  different 

initial  states  a  and  0  ,  with  a  _<  0  (a.<5|)  i  8  1 . n).  Call  the 

resulting  processes  X_  (t)  and  X  g  (t).  We  construct  a  brivariate  version 
{(X  (t) ,  X  ft)) ,  t  >  0}  by  setting  X  (0)  =  a ,  X  a  (0)  =  0  and  having 

the  transition  mechanism  (the  2n  independent  Poisson  processes  discussed 
in  Section  3)  be  identical  for  both  X  and  X  Q  .  It  immediately  follows 
that  Xo(t)  <  X.g(t)  for  all  t  under  the  constructed  version.  It 
further  follows  that  for  any  decreasing  set  B(xeR  and  y  <_  x.  implies  yeB) 
that  T a ( B ) ,  the  first  passage  time  to  B  starting  in  a,  is  stochastically 
smaller  than  TC(B).  Thus  for  a  coherent  system  (i.e.  for  B  decreasing) 

P 

the  time  to  first  failure  is  stochastically  increasing  in  the  initial 
state  (under  the  partial  ording  a<_0<=>  «U^i  *  ^l*’-*0)-  A  consequence 
is  that  Tj(B)  is  NBU  (where  i=l,...n),  a  result  of  Ross  [15]. 

Furthermore  Pr(XU)eB|)[(0)  =  1)  is  increasing  in  t. 
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It  is  not  known  whether  T-j ( B)  is  IFRA.  Brown  and  Rao  [6] 
show  that  the  first  passage  time  to  a  decreasing  set  for  a  stochastically 
monotone  Markov  chain  with  monotone  paths  on  a  partially  ordered  finite 
state  space  is  IFRA,  but  in  the  present  case  the  paths  are  not  monotone. 


5.  Comparison  of  Perfect  State  and  Steady  State 

Returning  to  the  construction  of  Xjt)  given  in  Section  3,  define 

a  component  i  activity  to  mean  that  an  event  takes  place  from  either  of 

the  two  Poisson  processes  associated  with  component  i.  Note  that  a 

component  i  activity  does  not  necessarily  imply  a  component  i  change  of 

state.  The  waiting  time  until  activity  from  component  i  is  clearly 

exponential  with  parameter  .  Let  Z..  denote  this  random  time. 

Note  that  X..(Z..  +  s)  has  the  steady  state  distribution  of  X^t)  for 

all  s>0,  independent  of  the  initial  state.  Defining  Z  =  max  Z.  it 

l<i<n  1 

follows  that  X^Z  +  s)  has  the  system  steady  state  distribution 

regardless  of  the  initial  system  state.  Thus  system  equilibrium,  which 

is  not  attained  for  any  fixed  t,  is  attained  at  the  random  time  Z. 

The  random  variable  Z  is  distributed  as  the  maximum  of  n  independent 

exponential  random  variables  with  parameters  X.  +  y.. ,  i  =  l,...n. 

* 

Denote  by  T^(B)  the  first  passage  time  starting  from  the  steady  state 
distribution  restricted  to  G,  and  let  q  equal  the  steady  state 
probability  of  G.  Note  that  Pr[T^  >  t)  =  q  Pr(Tg  >  t)  for  t  >_  0. 
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We  suppress  the  set  B  in  the  notation  when  there  is  no  room  for  ambiguity. 

st  * 

By  the  stochastic  monotonicity,  T,  >  Tc  ;  by  the  above  construction 
st  -  E  /  , 

T-|  £  Z  +  T^,  where  Z  and  are  independent.  Thus: 

*  st  st 
")  TE  iTl  iTE*Z 

with  Z  and  independent  and  Pr(T^  >  t)  =  q  Pr(T^  >t). 

Consider  the  parallel  system  (B={0}).  Here,  starting  in  state 
£  a  visit  to  B  prior  to  time  Z  is  impossible.  Thus: 

st 

(2)  T1  (0)  =  Z  +  Te(0) 

Result  (1)  is  new,  while  result  (2)  was  obtained  by  Brown  [4]  .  These 
inequalities  are  applied  in  Section  7  . 


6.  Parallel  System 

In  this  section  we  present  an  improved  and  simplified  derivation 
of  some  of  the  author's  results  (Brown  [4]  )  for  parallel  systems. 

Return  to  the  construction  of  the  0  -  1  process  for  a  single 
component,  with  X(0)  =  1.  Recall  that  as  soon  as  an  event  occurs  from 
the  superimposed  Poisson  process,  the  process  X(t)  enters  steady  state. 
Thus  P i q ( t )  =  Pr(X(t)  =  0 | X(0)  =  1),  equals  the  steady  state  probability 
of  state  0  ( X 1 X  +  u)  multiplied  %  the  probability  that  equilibrium  is 


achieved  prior  to  time  t(l-e  Thus: 

<3>  p10<t>  ■  (l-e-<A+u)t) 


The  result  (3)  is  well  known.  It  is  usually  derived  by  differential 


equations. 


It  immediately  follows  from  (3)  that: 


(4)  P1(J)/»I  Ar  0  -  e"(X1+w1)t) 


M  i=l  V»i 


Recall  the  random  variable  Z  defined  in  Section  5.  Let  Fj  denote 

its  cdf.  Furthermore,  note  that  tt  7M— —  is  the  steady  state  probability 

Ai  yi 

of  state  0  ,  which  we  denote  by  p.  Thus  from  (4): 

(5)  P1(jS)  =  pFz(t). 

Define  Q  to  be  the  Laplace  transform  of  P1  ^  ,  and  <J>Z  to  be 
the  Laplace  transform  of  Z.  It  follows  from  (5)  that: 


(6) 


pipz(a) 


Next,  define  W  to  be  the  waiting  time,  starting  in  steady  state 
with  the  system  known  to  be  working,  for  the  first  activity  from  one  of 
the  working  components  (recall  that  a  component  activity  is  an  event  from 
the  superimposed  Poisson  process  governing  that  component).  Thus,  W  is 

conditionally  exponentially  distributed  with  parameter  E(A.+y.),  given 

flf 

that  a  is  the  set  of  working  components.  The  probability  of  a  given 

^  1  ^  j  71  ^  1 

steady  state  restricted  to  6  is  given  by  p  (a)  =  ^-(  5 xT+yT^* 
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where  q  =  1-p,  p  defined  above.  Thus: 


(7)  Pr(W>t)  =  i  p  *(a)  e 


-tE(A.+p.) 
a  l  v 


From  (3)  and  (7)  we  obtain: 


n  A. 


(8)  pm  = 

—  —  1  1  1 


yi 


Vyi 


]  =  qPr(W>t)+p 


it  follows  from  (8)  that 

(9)  *  ? 


wnere 


-  1  +  /  dt,  and  ipw  is  the  Laplace  transform  of  W. 


Furthermore  since. 


(,0)  Pl'o'  ■  L  po!os)  dFl‘s> 


it  follows  that: 


Mil  ill  =  W.  )  /.i,  ( Q ) 

uu  ’H'i  ♦  1, 0^0,0 


Combining  (5),  (8)  and  (11)  we  derive  the  main  results: 

(12)  i|>T  (a)  =  p*z(a) 

1  l-ql'tila) 


(13)  4»T(a)  =  P 

E  1-q^a) 
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>*.*•*¥ 


N 


*  * 

Note  that  if  N  is  geometric  with  parameter  p,  N=N  -1,  and 


R  =  I  W-  where  {  W .  ,i  >1}  is  i J i . d .  with  Laplace  transform  ik,  independent 

■j*  *  1  1  w 

of  N,  then  ~  T-q'^~ ~('a~)  *  Thus  ^rom  (^)  and  (13),  SWi 

W  1 

and  T,^  Z  +  £  W.:,  where  Z  is  independent  of  ^W. 

I  1  1  1  1  • 

From  (12)  and  (13),  the  moments  are  easily  derived  (Brown  [4]). 


I  have  not  been  able  to  find  a  sample  path  interpretation  for  the 

cf  ty  # 

representation'  TP  =  E  W . .  By  following  the  process  at  suitable  activity 
b  1  1 
M 

epochs  I  obtain  TP  =  E  V.  where  M  is  a  stopping  time,  Pr(M=0)  =  p 
b  1  1 

and  ~  W.  However  I  find  it  surprising  that  this  complicated  sum 
behaves  like  a  geometric  sum  of  i.i.d.  random  variables,  and  cannot  prove 


it  directly  (i.e.  without  Laplace  transforms  as  above). 


7.  Bounds  For  Kei Ison's  Approximation 

In  addition  to  previously  defined,  Keilson  ClO]  introduced, 
Tv,  the  post-recovery  exit  time.  This  is  obtained  oy  stating  the  system 
in  steady  state  restricted  to  B,  and  waiting  for  the  first  visit  to  G; 
the  waiting  time  from  this  visit  to  G  until  the  next  visit  to  B  is,  by 


definition,  the  post-recovery  exit  time.  The  important  facts  about  Tv  are: 

( i )  ETy  is  computable  for  the  general  coherent  system,  while 

★ 

ET^  and  ET^  are  only  known  in  special  cases. 

(ii)  Ty  is  completely  monotone. 

★ 

(iii)  is -the  stationary  renewal  distribution  corresponding  to 

V 

Keilson's  insightful  approach  for  the  approximation  of  the 

reliability  of  repairable  systems  is  to  employ  an  exponential  distribution 

* 

with  mean  ETy  as  an  approximation  for  Ty,  and  T-j.  The  quantity  ETy  is 

easily  computable,  and  Keilson  motivates  the  general  principle  that  for 

* 

highly  reliable  systems  Ty,  T^  and  T-j  should  have  approximately  the  same 

distribution  and  that  this  distribution  should  be  approximately  exponential. 

The  approximate  exponential ity  and  equality  of  distributions  needs 

★ 

to  be  quantified.  As  Ty  and  T£  are  completely  monotone  (i.e.  mixtures  of 

exponential  distributions),  Keilson  looked  at  the  problem  of  approximating 

completely  monotone  distributions  by  exponential  distributions.  He  proposed 

EX2 

that  for  X  completely  monotone,  that  p  =  2  "  1 »  be  used  as  a  distance 


11 


measure  between  X  and  an  exponential  distribution  with  the  same  mean. 

Brown  [5]  ,  obtained  bounds  on  tf  distance  to  exponential ity. 

Assume  that  F  is  IMRL  (increasing  mean  residual  life)  a  larger  class  than 

completely  monotone,  and  that  G  is  the  stationary  renewal  distribution 

corresponding  to  F.  Then: 

(14)  max(sup|T  (t)  -  e  ^ll|,  sup|F(t)  -  6(t)|, 
t  t 

sup|G(t)  -  e"t/y|,  sup|G(t)  -  e't/yG|)  <  4  *l-f 
t  t  p  mg 

A 

where  Pq  =  ,  the  mean  of  G.  Moreover  the  bound  for  sup | F( t)  -  e 

and  sup|F(t)  -  TT( t ) |  is  sharp  even  within  the  subclass  of  completely 
monotone  distributions. 

Returning  to  first  passage  times,  we  let  F  be  the  distribution  of 
Tv  and  G  the  distribution  of  .  Then  from  (14): 

(15)  max  (sup|FT  (t)-e’t^E^V| ,  sup|F,*(t)-e-t^ETV| , 

t  'v  'e 

*  ET 

sup|F  (t)-F*(t)|,  sup|F*(t)-e't/ETE|)  5  1  - l 

t  TV  ‘E  t  1 E  ETe 

To  illustrate  the  parallel  case  recall  that  W  is  the  waiting  time 
until  activity  from  a  working  component,  starting  in  steady  state  restricted 
to  G  (i.e.  at  least  one  component  is  working).  The  random  variable  Js 
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distributed  as  a  mixture  of  exponential  distributions.  It  is  exponential 
with  parameter  £(X>y^)  with  probability  p*(a). 

It  turns  out  (Brown  [4]  p.386)  that: 

(17)  ETV  ■  X  ■  M*) 

1  *  N 


we  have: 


Moreover  since  TF  =  ZW.  and  EU  3  EW 
b  1  1 


08>  ete 


It  also  follows  from  (14)  (since  TV  is  IMRL)  that: 

★  c 

*  /rT* 

(16)  sup | Ft*  (t)-e‘t/ETEI-  p^FT 
t  1 E  E 


ETE 

(19)  pv  =  pr  -i  =  (eueu  ')-i 


Furthermore: 


*  ETr  m2 

(20)  p£  =  - ^--1  *  pf-^-y  -1)  =  Ppu 

E  2(ETe)Z  2(EW)Z  W 


Thus  the  Keilson  distance  from  exponential ity  of  TE  (pE)  equals 
p,  the  steady  state  probability  of  system  failure,  multiplied  by  pw,  the 
Keilson  distance  from  exponential ity  of  W.  In  both  (19)  and  (20)  the  p 


parameters  are  explicitly  available  in  terms  of  y^,  1*1,  ...n. 
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Finally,  inequalities  (1)  and  (2)  relate  the  approximate  ex¬ 


ponential  ity  of  and  Ty  to  .  From  (1): 


(21)  F*(t)  5  F  (t)  <  EF  (t-Z)  <  Ef*(t-Z) 

'e  'i  'e  'e 

From  (21)  we  obtain: 


(22)  0  <  Ft  (t)  -  F*(t)  <  EF  *(t-Z)  -  F*(t) 

h  'e  'e  'e 

Since  is  completely  monotone  its  pdf  is  decreasing.  Therefore 


the  interval  of  length  z  with  highest  probability  is  (0,z].  Thus  from  (22) 


(23)  Ft  (t)  -  F*(t)  <  EF*(Z) 

T1  TE  'e 

But,  since  T^  has  a  decreasing  pdf,  F(z)  is  concave  and  thus: 


(24)  EF*(Z)  <  F*(EZ) 

'E  'e 

Next,  since  T^  is  completely  monotone  it  is  also  DFR  (decreasing 


failure  rate).  If  follows  from  Brown  [5],  that: 

★ 

EZ  4. 

f25)  F*(EZ)  <  1-e  ETE  5  44r- 4  p* 
TE  E 

Combining  (21)  -  (25)  we  arrive  at: 


(26)  OsFt  (t)-  FT*(t)  S  ilir+  pj  for  all  t. 

1  *E  E 

Furthermore,  it  follows  from  (15)  and  (26)  that: 
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(27)  sup|FT  (t)  ■ 
t  1 

and  from  (16)  and  (26)  that: 


.e-t|ETv|  <  Jv  t  EZ_t 


P/T-  ET- 


-tlFT*  >  Pr(Pc^^)  T7 

(28)  sup|r  (t)  -  e  '  TE|  .  +  ffw 

t  'l  (p£+D  E 


Lastly,  since  T*  is  DFR  and  has  failure  rate  l|ETy  at  t  =  0,  it 
follows  that  F,*  >  e_tlETV.  Thus  from  (21)  -  (24): 

'e 

(29)  sup|F  (t)  -  F  *  (t)  |  <  l-e'EZ/ETV  s  -|f 
t  ‘l  '£  uV 


From  (15)  and  (29)  we  obtain: 


(30)  sup|FT  (t)  -  e 
t  ‘l 


1 V  -  <  PV  _EZ_ 

1  "  PV+1  etv 


8.  Numerical  Example 


Consider  a  parallel  system  of  three  identical  components  with  failure 


rate  .01  and  repair  rate  1 


Here  ETy  =  343,433.33,  ET*  =  345,181.85,  Py  =  .005091, 

A,  =  005066,  p*  =  10‘8,  and  EZ  =  1.815182.  Consequently  F  (t),  F*(t) 

Py+ l  t  V  t 

and  e_tlETV  are  all  within  a  distance  of  .005066  for  all  t.  Furthermore 


*(t)  ..  e_t I ETE |  <  p*  ■  10“8,  and  sup | F1 ( t) -e_t * ETE|  -  2p£  +  =  -00000528. 

TE  h  ~ 
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